Linear Algorithms for Computing the Lyndon Border Array and the Lyndon Suffix Array

نویسندگان

  • Ali Alatabbi
  • Jacqueline W. Daykin
  • Mohammad Sohel Rahman
چکیده

We consider the problem of finding repetitive structures and inherent patterns in a given string s of length n over a finite totally ordered alphabet. A border u of a string s is both a prefix and a suffix of s such that u 6= s. The computation of the border array of a string s, namely the borders of each prefix of s, is strongly related to the string matching problem: given a string w, find all of its occurrences in s. A Lyndon word is a primitive word (i.e., it is not a power of another word) which is minimal for the lexicographical order of its conjugacy class (i.e., the set of words obtained by cyclic rotations of the letters). In this paper we combine these concepts to introduce the Lyndon Border Array Lβ of s, whose i-th entry Lβ(s)[i] is the length of the longest border of s[1 . . i] which is also a Lyndon word. We propose linear-time and linear-space algorithms 1 for computing Lβ(s). Further, we introduce the Lyndon Suffix Array, and by modifying the efficient suffix array technique of Ko and Aluru [KA03] outline a linear time and space algorithm for its construction.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Algorithms to Compute the Lyndon Array

In the Lyndon array λ = λx[1..n] of a string x = x[1..n], λ[i] is the length of the longest Lyndon word starting at position i of x. The computation of λ has recently become of great interest, since it was shown (Bannai et al., The “Runs” Theorem [2]) that the runs in x are computable in linear time from λx. Here we first describe three algorithms for computing λx that have been suggested in th...

متن کامل

Lyndon Array Construction during Burrows-Wheeler Inversion

In this paper we present an algorithm to compute the Lyndon array of a string T of length n as a byproduct of the inversion of the Burrows-Wheeler transform of T . Our algorithm runs in linear time using only a stack in addition to the data structures used for Burrows-Wheeler inversion. We compare our algorithm with two other linear-time algorithms for Lyndon array construction and show that co...

متن کامل

Sorting Suffixes of a Text via its Lyndon Factorization

The process of sorting the suffixes of a text plays a fundamental role in Text Algorithms. They are used for instance in the constructions of the BurrowsWheeler transform and the suffix array, widely used in several fields of Computer Science. For this reason, several recent researches have been devoted to finding new strategies to obtain effective methods for such a sorting. In this paper we i...

متن کامل

A Thinning Method of Linear And Planar Array Antennas To Reduce SLL of Radiation Pattern By GWO And ICA Algorithms

In the recent years, the optimization techniques using evolutionary algorithms have been widely used to solve electromagnetic problems. These algorithms use thinning the antenna arrays with the aim of reducing the complexity and thus achieving the optimal solution and decreasing the side lobe level. To obtain the optimal solution, thinning is performed by removing some elements in an array thro...

متن کامل

Words over an ordered alphabet and suffix permutations

Given an ordered alphabet and a permutation, according to the lexicographic order, on the set of suffixes of a word w, we present in this article a linear time and space method to determine whether a word w′ has the same permutation on its suffixes. Using this method, we are then also able to build the class of all the words having the same permutation on their suffixes, first of all the smalle...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • CoRR

دوره abs/1506.06983  شماره 

صفحات  -

تاریخ انتشار 2015